Picture for Ruimao Zhang

Ruimao Zhang

TouchGuide: Inference-Time Steering of Visuomotor Policies via Touch Guidance

Add code
Jan 28, 2026
Viaarxiv icon

Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge

Add code
Jan 26, 2026
Viaarxiv icon

CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion

Add code
Jun 17, 2025
Viaarxiv icon

WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning

Add code
Jun 09, 2025
Viaarxiv icon

DFVO: Learning Darkness-free Visible and Infrared Image Disentanglement and Fusion All at Once

Add code
May 07, 2025
Viaarxiv icon

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

Add code
Mar 20, 2025
Viaarxiv icon

DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation

Add code
Mar 14, 2025
Figure 1 for DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Figure 2 for DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Figure 3 for DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Figure 4 for DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Viaarxiv icon

Unlock the Power of Unlabeled Data in Language Driving Model

Add code
Mar 13, 2025
Figure 1 for Unlock the Power of Unlabeled Data in Language Driving Model
Figure 2 for Unlock the Power of Unlabeled Data in Language Driving Model
Figure 3 for Unlock the Power of Unlabeled Data in Language Driving Model
Figure 4 for Unlock the Power of Unlabeled Data in Language Driving Model
Viaarxiv icon

Semantic-Supervised Spatial-Temporal Fusion for LiDAR-based 3D Object Detection

Add code
Mar 13, 2025
Figure 1 for Semantic-Supervised Spatial-Temporal Fusion for LiDAR-based 3D Object Detection
Figure 2 for Semantic-Supervised Spatial-Temporal Fusion for LiDAR-based 3D Object Detection
Figure 3 for Semantic-Supervised Spatial-Temporal Fusion for LiDAR-based 3D Object Detection
Figure 4 for Semantic-Supervised Spatial-Temporal Fusion for LiDAR-based 3D Object Detection
Viaarxiv icon

NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants

Add code
Feb 19, 2025
Viaarxiv icon